Years Dataset

In this notebook I will be working with the data_by_year_o.csv. This is a time-series dataset from which I will try to gain knowledge about how the other features in the dataset are correlated to the year. I will be using K-means clustering as well as Agglomerative clustering.

Analysing the dataset

EDA

Calculating the Spearman coefficient to check the direction of association between the attributes year and popularity.

Compare the characteristics of music every 10 years.

2D PCA

Reducing the dimensionality of the dataset in order to only capture the most important features.

3D PCA

K-means clustering

Agglomerative Clustering